Discovered Rule Filtering Using Information Retrieval Technique
نویسندگان
چکیده
A data mining system can semi-automatically discover knowledge by mining a large volume of data, but the discovered knowledge is not always novel and interesting to the user. We propose a discovered rule filtering method to filter rules discovered by a data mining system and to produce ones that are novel and interesting to the user by using information retrieval technique. In the method, we rank discovered rules according to the result of information retrieval from the Internet. In this paper, we show the steps of discovered rule filtering by using a concrete example of clinical data mining and MEDLINE document retrieval. Preliminary results show that this method has merits in not only filtering discovered rules but also providing a new viewpoint to the rules to give a chance to invoke a new data mining process.
منابع مشابه
Micro View and Macro View Approaches to Discovered Rule Filtering
A data mining system tries to semi-automatically discover knowledge by mining a large volume of raw data, but the discovered knowledge is not always novel and may contain unreasonable facts. We try to develop a discovered rule filtering method to filter rules discovered by a data mining system to be novel and reasonable ones by using information retrieval technique. In this method, we rank disc...
متن کاملPreliminary Evaluation of Discovered-Rule-Filtering Methods
Data mining systems semi-automatically discover knowledge by mining a large volume of data, but discovered knowledge is not always novel to users. We discuss a discovered rule filtering method to filter rules discovered by a data mining system into ones that are novel to the user by using information retrieval results from the Internet. We have two methods; the micro view and the macro view met...
متن کاملUsing a Spring Embedding Algorithm to Display Term Relationships from a Medical Concept Discovery System
Visualization can make explicit the subtle and interdependent relationships that exist among terms in a broad topical area. Such visualizations may help in refining retrieval or enhance the user’s understanding a complex area. We will demonstrate an application consisting of a term generation service, UMLS categorization of the terms, and a spring embedded visualization of the term set relation...
متن کاملRelations between Terms Discovered by Association Rules
This paper presents an approach to automatic knowledge base construction used for automatic query expansion. We have designed a system that is able to gain domain knowledge from analysed texts. We have used a Data Mining technique called association rules. We show how this technique is able to discover knowledge about relations between terms. We study how these relations can be used to improve ...
متن کاملA New Similarity Measure Based on Item Proximity and Closeness for Collaborative Filtering Recommendation
Recommender systems utilize information retrieval and machine learning techniques for filtering information and can predict whether a user would like an unseen item. User similarity measurement plays an important role in collaborative filtering based recommender systems. In order to improve accuracy of traditional user based collaborative filtering techniques under new user cold-start problem a...
متن کامل